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UPCROSSING INEQUALITIES FOR STATIONARY SEQUENCES 

AND APPLICATIONS 

By Michael Hochman 

Hebrew University of Jerusalem 

For arrays (Sij)i<i<j of random variables that are stationary in 
an appropriate sense, we show that the fluctuations of the process 
("Si,n)5S=i can be bounded in terms of a measure of the "mean sub- 
additivity" of the process (Sij)i<i<j . We derive universal upcrossing 
inequalities with exponential decay for Kingman's subadditive er- 
godic theorem, the Shannon-MacMillan-Breiman theorem and for 
the convergence of the Kolmogorov complexity of a stationary sam- 
ple. 



1. Introduction. Let us say that a sequence (X n )^L 1 of real numbers has 
k crossings (or upcrossings) of an interval [s,t] if there are indices 



1 < h < ji < ii < J2 < ■ • • < ik < 3k 



such that Xi m < s and Xj m > t. Allowing X n to be random, it easily follows 
that limX n exists a.s. if and only if, for every interval of positive length, the 
probability of infinitely many crossings of the interval is 0. 

There are a number of classical limit theorems in probability that can 
be formulated and proven in this way, the best known of which is Doob's 
upcrossing inequality for L 1 martingales [6]: if (S n )^ = i is an L 1 martingale, 
then, for s < t, 

P((5 n )^L 1 has k upcrossings of [s,t]) < — -j^— 

(see also Dubins [7]). A similar inequality was proven by Bishop for the 
time averages S n = - J22=i X n of an L 1 stationary process (X n )™ =1 [1, 2]. 
Assuming nonnegativity of the process instead of integrability, Ivanov [8] 
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proved the following beautiful result: for every s < t, 



(1.1) F((S n )^ = i has k upcrossings of [s,t]) < 




(see [4]; for related results, see Jones et al. [9], and Kalikow and Weiss [10]). A 
remarkable aspect of these inequalities is that they hold universally: except 
for trivial normalization, they do not depend on the process in question. Nei- 
ther martingales nor ergodic averages admit universal rates of convergence 
and it is all the more surprising that such general bounds for the fluctuations 
exist. 

In this paper, we establish a general upcrossing inequality for certain 
sequences associated with stationary processes, in terms of a certain measure 
of "mean subadditivity" of the process. We consider arrays {Sij)\<i<j of 
random variables, although we will usually identify the ordered pair 
that indexes Sij with the integer interval = [i,j] DN, and if U = is 
such an interval, we write Sjj for Sij. We assume the process to be stationary 
with respect to translation of the indexing intervals, that is, that for every 
m G N, one has 

( S [i;j])i<i<j = (<%+m,j+m])i<i<j hi distribution. 

One very general way to obtain such arrays is by applying a function to 
samples of stationary processes: if (X n )^ =1 is stationary and g is any function 
defined on finite sequences, then Sly = g(Xi, Xi + i, . . . , Xj) satisfies these 
assumptions. 

Let / C N be an interval and 6 > 0. We say that a collection I\, . . . , I r of 
intervals 5 -fills I if all of the intervals are contained in I and |J\|J Jj| < <5|/|, 
where | • | denotes cardinality. 

Theorem 1.1. Suppose that ($y)i<i<j is stationary in the above sense. 
Let s <t and < 6 < j. Then, for every k, 

p / (Si^ has k \ 
\ upcrossing of [s, t] J 

(there exists n> k such that Si^ n > t \ 
and [1 ;n] can be 5-filled by disjoint 
intervals Vi,...,V r satisfying Svj < s J 

The constants c and < p < 1 depend only on 6 (but not on the process or 
on s,t). 

In applications, one optimizes over S to get a bound for the left-hand side 
which is often independent of the process. Theorem 1.1 is effective and the 
constants may be computed explicitly, although they are surely not optimal. 
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This inequality cannot be reversed and in Section 2, we give a simple example 
in which the left-hand side decays exponentially and uniformly for a certain 
class of processes, but the right-hand side can decay arbitrarily slowly. 

Theorem 1.1 can be generalized in several ways. It remains valid when 
one starts with a "two-sided" stationary array (<Sij)-oo<i<j'<oo and sets 
S n = S- ntn , and there is a version for S n = Sy n , where V{ is an arbitrary 
increasing sequence with £ V\. It can also be extended to the multidimen- 
sional setting, where the process is indexed by cubes instead of segments. 
These versions require minor modifications of the proof we give. 

Our first application is to Kingman's subadditive ergodic theorem. Let 
(X mjn )i< m < n be stationary, in the sense described above, and subaddi- 
tive, that is, X^^n < m + X m+ i^ n whenever k < m < n. Kingman's the- 
orem states that under some integrability conditions, if X m „ n is station- 
ary and subadditive, then ^X\^ n converges almost surely. As examples of 
this situation, consider X m>n = SiL m ^i> where Yj is a stationary process or 
X m , n = \\Z m , ...,Z n \\, where Z% is a stationary sequence of operators. 

Polynomial decay for upcrossings in Kingman's theorem for integrable 
processes was proven by Krawczak [11]. We establish an exponential version 
of this, with integrability replaced by a boundedness condition: 

Theorem 1.2. Let {X m>v )\< m < n be stationary and subadditive. Suppose 
that X\ t i < M a.e. for a constant M . Then, for every s <t, 

¥((S n )^ = i has k upcrossings of [s,t]) < c • p k 
for constants < p < 1 and c that depend only on s,t and M . 

We next turn to the convergence of Shannon information of samples. 
Given a finite- valued process (Xj)^ , the information of the sample X\ , . . . , X, 
is the random variable 

I(X 1 ,...,X n ) = -logF(X 1 ,...,X n ). 

Here, for a fixed sequence £ = £i, . . . we write P(£) for the probability of 
observing this sample, so P(-X"i, • ■ • ,X n ) is the probability of observing the 
sample that was actually observed. 

The Shannon— MacMillan— Breiman theorem [13] is one of the fundamental 
theorems in information theory, asserting that —I(X\, . . . ,X n ) converges 
almost surely. See, for instance, [12]. 

Theorem 1.3. Suppose that (X n )^ =1 is a 0, 1-valued stationary process. 
Let S n = -I(Xi, . . . ,X n ). Then, for every s <t, 

P((<5n)^=i has k upcrossings of [s,t]) < c • p k 
and the constants < p < 1 and c depend on s,t. 
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The same result holds for processes with a finite number r of symbols, 
but the constants will then also depend on r. We note that there is no 
universal rate of convergence for this limit and Theorem 1.3 seems to be 
the first effective proof of its convergence. It is interesting that the classical 
proofs of the convergence of —I(X\, . . . , X n ) rely on the ergodic theorem and 
the Martingale theorem, but despite upcrossing inequalities being known for 
both of these, it is unclear how to combine them to deduce Theorem 1.3. 

Bishop's upcrossing proof of the ergodic theorem was part of a larger pro- 
gram of Bishop's to "constructivize" mathematical analysis. As Theorem 1.1 
is effective, it can probably be adapted to this setting, leading to constructive 
proofs of the Kingman and Shannon-MacMillan-Breiman theorems. 

Another closely related result involves algorithmic complexity rather than 
Shannon information. For a finite string x of O's and l's, let n(x) denote the 
Kolmogorov complexity of x, sometimes referred to as the minimal descrip- 
tion length of x (see Section 4). Although k(-) is not formally computable, it 
has been extensively studied as a nonstatistical measure of complexity and 
is closely related to Shannon's theory of information; see, for example, [5]. 
The following result is therefore an analog of Theorem 1.3; the existence of 
the limit was shown by Brudno [3]. 

Theorem 1.4. Suppose that (X n )^ =1 is a stationary 0,1-valued process. 
Then, writing S n = ——k{X\, . . . ,X n ), for every s <t, we have 

^((Sn)^! has k upcrossings of [s,t]) < c • p k 
for constants c and < p < 1 that depend only on s,t. 

The rest of the paper is organized as follows. First, we use Theorem 1.1 
to derive the applications. In Section 2, we derive the result on Kingman's 
theorem, and show that the inequality in Theorem 1.1 cannot be reversed. 
In Section 3, we prove the upcrossing inequality for the Shannon— McMillan- 
Breiman theorem and in Section 4, we derive the inequality for Kolmogorov 
complexity. In Section 5, we reduce Theorem 1.1 to a combinatorial lemma 
whose proof is given in Section 6. 

2. The subadditive ergodic theorem and an example. In this section, we 
discuss the relation between Theorem 1.1 and ergodic theorems. 

Proof of Theorem 1.2. Suppose that V±, . . . , V r are disjoint subinter- 
vals of [1; n] such that mIv; < s. Let U = [0, 1] \ Ui=i Vi- By subadditivity, 

n n \Vi\ 'H'r n n 

i 41 jet/ 
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(in the last inequality, we used stationarity to get Xjj < M). Thus, if [l',n] 
is (^-filled by the Vi, then \U\ <5n, so 

-X ln <(l-S)s + SM 
n 

and if 5 = , then it impossible that —X\^ n > t. Hence, for this 5, when 
we apply Theorem 1.1 with the sequence Sij = j_\ +1 Xij, the event on the 
right-hand side of inequality (1.2) is empty. The theorem follows. □ 

Next, we show that decay of the right-hand side in Theorem 1.1 is not 
necessary in order to get fast decay on the left. Let (X n )^ =1 be a stationary 
process with values in [—1, 1] and let 

so Si tn is obtained by repeating elements from the sequence of ergodic av- 
erages {^{X± + • • • + X n ))^ =1 . The latter sequence obeys an upcrossing in- 
equality which does not depend on X n ; thus, the former sequence also does. 
However, the following proposition shows that one cannot obtain this from 
Theorem 1.1. 

Proposition 2.1. Let 5> 0, and 

(there exists n> k such that Si t7l > | 
and [l,n] can be 5 -filled by disjoint 
intervals V\ , ■ ■ ■ , V r satisfying Sy d < - 

Then, for arbitrarily large k, there are processes for which Pfc > 1/6 and, 
in particular, the convergence pk — > is not uniform as the process (X n ) is 
varied. 

Proof. Fix 5 > 0. Let n > 1/5 and let (Xi)^ be the process whose 
unique sample path, up to translation, is the sequence with period 2n in 
which blocks of l's and — l's of length n alternate. Set k = n 2 . It is easily 
verified that (a) the probability that the first 2n/3 symbols of a sample are 
l's is 1/6 and Si^, in this case, is > 1/2; and (b) if X\ = 1, then, taking j to 
be the first index with Xj = — 1, we have Sj t k < — 1/2 and [1; k] is (5-covered 
by \j\k\. Thus, for each square k, there are processes for which p^ > 1/6. □ 

3. The Shannon McMillan Breiman theorem. In this section, we prove 
Theorem 1.3. Fix s < t and a parameter 5 > 0. Set 

= ~ . , -, I{Xi, ...,Xj). 

3 — i + 1 
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In order to apply Theorem 1.1, for each n G N, we wish to bound the prob- 
ability of the event 

g _ ( Si :n > t and [1, n] can be 5-filled by disjoint 1 
n \ intervals V\ , . . . , V r satisfying Svj < s J ' 

Since 

S itj >t <=> F(X i ,...,X j )<2- t ^- l+1 \ 
S^Ks F(X i ,...,X j )>2^- l+1 \ 
we have the trivial bound 

, , x o-tnn ( words w £ {0, l} n which can be <5-filled with\ 
[ ' [ n) ~ # \ disjoint words v satisfying P(v) > 2~ s ^ ) ' 

where i(v) denotes the length of v. 

To estimate the right-hand side of (3.1), we note that each word w that 
is counted on the right can be constructed as follows: 

(1) choose a subset / C [1; n] of size < Sn; 

(2) choose the symbol w(i) for each iE I; 

(3) for each maximal interval JC [l;n] \ /, choose a word v = w\j with 
F(v) > 2~ st{ - v \ 

To bound the number of words produced in (l)-(3), we bound the number 
of choices at each step. In step (1), we have < 2 n/l ( <5 )+ ( logn ) choices, where 
h(x) = — xlogx — (1 — x) log(l — x) (this is a standard consequence of Stir- 
ling's formula). In step (2), we have at most 2 <5n choices. Finally, in step (3), 
let Ji, . . . , J r be the maximal intervals in [1; n] \ I. The number of distinct 
words v of length | Jj| and satisfying ¥(v) > 2~ s \ Ji \ is clearly bounded by 
2 s \Ji\ t so the number of ways to choose such words with lengths | J\\, . . . , \ J r \ 
is at most 

r 

J~[ 2 S I^I = 2EI=i s \ J *l < 2 sn . 
i=l 

It follows that the number of words counted on the right-hand side of equa- 
tion (3.1) is 

< 2(s+h(S)+8)n+o(logn) 

so 

F(B n ) < 2~('^ s ~ h ( 5 )~ <5+0 ( 1 )) n . 

Hence, if 5 is small enough in a manner depending on s,t, then this bound 
is summable and gives 

'|J B n \ <c-p k 

n>k ' 
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for constants c and < p < 1 depending only on s,t. Since \J n>k B n is the 
event on the right-hand side of inequality (1.1) in Theorem 1.1, this com- 
pletes the proof of Theorem 1.3. 

4. Kolmogorov complexity. The Kolmogorov complexity k(x) of a 0, 1- 

valued string x is defined as follows. Fix a universal Turing machine U and 
let x* be a string of minimal length such that, when U is run on input x* , 
the output is x. Then, k(x) is the length of x* . Although k(-) depends on the 
universal machine U, changing U only changes «(•) by an additive constant. 

In this section, we prove Theorem 1.4. Note that S m ^ n = ^fc(x m , . . . , x n ) is 
not strictly subadditive, so Kingman's theorem does not apply; nonetheless, 
we can exploit the fact that it is "almost subadditive." 

Proposition 4.1. Let (X n ) ( £ =1 be a 0,1-valued stationary process and 
set S n = -k(X\, . . . ,X n ). Then, for every s <t, there is a 5 > and an N 
such that the event 

{there exists n> k such that Si^ n >i ) 
and [l,n] can be 5 -filled by disjoint > 
intervals V\ , . . . , V r satisfying Sy- < s J 

is empty for all k> N . 

Proof. We reason as in the previous section. Suppose that x S {0, l} n 
and [1; n] can be <5-filled by a disjoint collection of intervals {V\, . . . , V m } with 
jyj-\K(x\v k ) < s. We can encode x by describing the choices of the three-step 
process outlined in the previous section. In step (3), we encode the pattern 
x\y m by writing down the algorithm that produces it. Thus, the estimate 
from the previous section shows that the number of bits required is 

h(S)n + n + k(x\ v J < (s + h(S) + 5)n + o(logn). 

We only require a constant-length program to extract the string x from 
this encoding. We also require an overhead of 0((h(5) +5)n) to encode this 
information in a self-punctuating way. Thus, 

k(x) < o(logn) + (s + C'{h{5) + S))n 

and if 5 is small enough (in a manner depending on s,t), then this implies 
k(x) < t, once n is large enough (how large n must be depends on C,C, 
which, in turn, depend on the Turing machine we are using, but is indepen- 
dent of s,t,5 and the process). The proposition follows. □ 

Theorem 1.4 now follows from Theorem 1.1. 
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5. Reduction of Theorem 1.1 to a covering lemma. The purpose of this 
section is to reduce the proof of Theorem 1.1 to a combinatorial statement 
about intervals, related to the effective Vitali covering lemma of Kalikow 
and Weiss [10]. This lemma is stated below in Lemma 5.1, but its proof is 
rather technical and we defer it to Section 6. 

Let {Sij)i<i<j be stationary, in the sense discussed in the Introduction. 
Fix s < t, a parameter 5 > and an integer k. For i G N, we define the events 

A = {(Si,i+n)£°=i has k upcrossings of [s,t]}, 

g _ f There is an n > k such that Si t i+ n > t, and [i, i + n] can be 1 
* \ (5-filled by disjoint intervals V\, . . . ,V r satisfying Sy 3 < s J ' 

By stationary, P(Ai) = f(Aj) and P(Bi) = F(Bj) for all We abbreviate 
A = A±, B = B\. Theorem 1.1 is then equivalent to 

P(A) < cp k + P(J3) 

for constants < p < 1 and c that depend only on s,t. 
The proof proceeds as follows. Fix a large iV and let 

Af = {(Si^ +n )n = i has k upcrossings of (s,t)}. 

Since A = \JAi, it suffices to show that, for each N, 

<cp k + F(B) 

with c, p independent of N. Fix an integer R much bigger than N (we will 
eventually take R — ► oo) and let / C {1, . . . , R} be the random set of indices 
defined by 

I = {i€ [l;R):Af occurs}. 
By stationarity of Sij, we have 

P(^) = iE^) = >(Ex.»)=>|7|. 

1=1 \'t=l / 

We proceed to estimate the expected size of /. We divide I into two parts: 

Iq = {i G / : Bi occurs} and I\ = I\Iq. 

Since ^E(|/ |) = P(-B), it suffices to show that ±\h\ < cp k . 

By definition, for each i G I\, there is a (random) sequence of k pairs of 
nonempty intervals Ui(l) C V^(l) C • ■ • C Ui(k) C Vi{k) whose left endpoint 
is i and with length < N, and such that Sjj.^ < s and Sy^m) > t f° r 1 — 
m < k. 

We now pass to a subsequence of the U^s and V^'s by performing two 
refinements of the sequence. First, clearly, \Ui{m + 1)| > |Vi(m)| > \Ui(m)\ 
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because Su i+1 / m ), Sjjj^ / Sy.^, and because of the given inclusions. Thus, 
\Ui(m)\ > 2(m — 1). If we delete the first feo = \k/2\ +1 pairs, then we are 
left with a sequence Ui(ko + 1) C Vi(ko + 1) C • ■ • C Ui(k) C Vi(k) of at least 
k' = [k/2\ — 1 pairs of intervals, all of which are of length greater than k. 

Second, notice that if \Vi(m) \ /\Ui(m)\ <1 + S, then i G Jo since Ui(m) 
would (5-fill Vi(m). Thus, for i G I\, we also have |Vi(m)|/|J7j(m)| > 1 + 8. 
Choose q so that 

that is, q = [log(72/5 2 )/log(l + 5)] +1. By deleting the intervals Ui(J), Vi(J + 
1) when j / (modq) and renumbering the remaining ones, we are left 
with a sequence of k" > \_(k' — l)/q\ pairs of intervals Ui(l) C Vi(l) C • • • C 
Ui(k") C all having length > and satisfying ^ < s, ^y/^ > ^ 

and which, additionally, satisfy the growth condition |Vi(m)| > |||C/j(m)| for 
every 1 < m < k" . 

We now apply a combinatorial result whose proof we defer to Section 6. 



Lemma 5.1. Let e < 1/4. Suppose that J C N is finite and that for each 
j £ J, we are given a sequence of intervals Uj(l) C Vj(l) C • ■ • C Uj(L) C 
Vj(L) with left endpoint j and satisfying |Vj(n)| > p-|C/j(n)|. Suppose that 
none of the Vj(n) 's can be Qe-filled by a disjoint collection ofUi(m) 's. Then, 



\J\< 1 + 



-(L-l)/(logl/e) 



u vm 



We apply the lemma to our situation with J = I±, L = k" and e = 5/6. The 
hypothesis is satisfied by definition of I\ . It follows that there are constants 
c > and < p < 1 depending only on 5 (hence on s, t), such that 

\h\<c-p~ k - \JViik") 
ieii 

<c- p~ k (N + R) 

and the last inequality holds because (Jie/i Vi(k") C [1; JV + i2], We thus have 



—El/I = —El 7 1 + —El < 



+ c-p~ k 1 + 



and the proof of Theorem 1.1 is complete by taking R — > cxo. 



10 



M. HOCHMAN 



6. Proof of Lemma 5.1. The remainder of this paper is devoted to the 
proof of Lemma 5.1. Some of the statements below are standard; we supply 
proofs for completeness. Others parts of the argument are related to the 
effective Vitali covering lemma from [10]. See [12, 14] for other examples of 
covering lemmas in probability and ergodic theory. 

We say that a collection of segments is disjoint if its members are pairwise 
disjoint. The following is a version of the classical Vitali covering lemma: 



Lemma 6.1. If V is a collection of intervals, then there is a disjoint 
subcollection V'CV with \ [j V'\ > | U V|/2. 

Proof. Let V' C V be a minimal collection satisfying U V' = U V. Order 
the intervals in V' by their left endpoints, say V' = {V±, V2, ■ ■ ■ , V m }. Then, 
the subsequence consisting of intervals with even indices is disjoint, similarly 
for the subsequence with odd indices. One of these must cover at least half 
of V. □ 



For e > and an interval U = [a; b] , the e-blowup of U is 

U £ = [a-e\U\,b + e\U\]nZ. 

Note that U ^U £ and \U £ \ < (1 + 2e)|£7|. For a collection U of intervals, we 
write U £ = {U £ :U eU}. 

Lemma 6.2. IfU is a collection of intervals, then \ [_]U £ \ < (l+2e)| {_]U\. 

Proof. Let A = {JU and decompose A into disjoint maximal intervals 
V\, . . . , Vfc, so j \JU\ =^2\Vi\. For each Vj, one clearly has 

U UE £ V ^ 

ueu-.ucv t 



thus 



k k 

|U^|<£ (J U£ <£(l + 2e)N = (l + 2e)|U4 
i=i ueU: ucVi i=i 



□ 



A tower of height M over a finite set / C N is a collection U = {Ui(k) :i£ I, 
1 < k < M} of intervals such that % is the left endpoint of Ui(k) [we shall 
actually only use the fact that i G Ui{k)\ and for each i E I, the sequence 
Ui{l) C Ui(2) C • • • C Ui(M) is strictly increasing. The kth. level of U is the 
collection 



U(k) = {Ui(k):ieI}. 
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Note that the intervals in IA(k) are not necessarily of the same size and, 
although \Ui(k)\ <\Ui(k + l)\, it need not be true that \Ui(k)\ < \Uj(k + 1)| 
if i^j. 

Let lA = {Ui(k)} be a tower of height M over a set /. The e- crust of 
lA is the set of V G tl whose e-blowup is strictly maximal with respect to 
inclusion, that is, 

V = {V G U{M) :if V £ C VF e for some W £lA, then V = W}. 

It is clear that U^^UV e . 

Lemma 6.3. Let < e < 1. Suppose that lA = {Ui(k)} is a tower over I 
of height 2 satisfying 

|C/i(2)|>-||0i(l)l for all ie I 
and V CZY(2) is i/ie e-crust oflA. Then: 

(1) /or each U G W(l) and F G V, if U C\V ^ then U C F e ; 

(2) i/iere existe Cl/(1) and a disjoint V C V snc/i i/iai |U^I — ||U^|j 
(UW) n (U V) = and c (U v £ ) u (UW). 

Proof. Let U elA(l) and V G V with UnV^0.In order to show that 
[7 C F £ , it suffices to show that e\V\ > \U\. Let i G I be such that U = £/j(l) 
and write = Ui(2), the interval "above" J7 in Z^. Since V is in the e-crust, 
we cannot have V s C W £ . Since V n W 7^ 0, this implies that 

(l + e)M>|W|> JiM, 

which gives the desired conclusion. 

To establish (2), apply the Vitali lemma to V to obtain a disjoint family 
VC V with I U V| > 3IUVI. Let 

U = {ueU(i):Un (\Jv) =0}. 

The conclusion now follows from (1). □ 

We want to replace the constant 1/2 in the Vitali lemma with a constant 
close to 1. This can be achieved using the standard trick of applying the 
Vitali lemma to several layers of covers and iteratively disjointifying each 
level in turn. 

Henceforth, all logarithms are taken to base 2. 
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Lemma 6.4. Let < e < 1. Suppose thatlA = {Uj(k)} is a tower of height 
M > 1 + log(l/e) over a set J and 

\Uj{k + l)\>^\Uj{k)\. 
There is then a disjoint subcollection V QU such that \ \JV\ > (1 — 3e)| \JU\. 

Proof. Set U =Li and Vo = 0. For 1 < n < M, we inductively define 
subcollections V n ,U n C ZY(M — n) satisfying: 

(1) V n ,U n QU n -i; 

(2) UW„_iC(UZ4)U(UK); 

(3) (UV„)n(U^n) = 0; 

(4) |UW n |<||UZ4-i|. 

To produce U n ,V n , we apply part (2) of Lemma 6.3 to the top two layers of 
U n -\. Clearly, the collection V = Ui<fc<M^fc * s disjoint and [JU C ((J V e ) U 
(U^a/-i)- By property (4), we have 

|Uwm-i|<(j) |U w p e |U4 

so 

|U v i > |U W | - \U U M-i\ > (l-e)\\Ju\. 
Using the inequality | |JV| > jq^l [JV e \ from Lemma 6.2, we have 

M>^ £ \{Ju\>(l-3e)\{Ju\, 



as desired. □ 



For the remainder of this section, we adopt the following notation. Fix an 
integer L and < e < 1, a finite set I C Z and two towers U = {Ui(k)} and 
V = {Vi(k)} of height L + 1 over / satisfying 

Ui(0) C VKO) C ^,(1) C V-(l) C • • ■ C Ui(L) C Vi(L) 

(for convenience, we start from level 0) and 

> -^■|Z7j(A;)|. 

Note that this ensures a similar growth rate for the substack U. 

Lemma 6.5. Let L > 1 + log(l/e). Then, either there is a V G V(L) 
which can be 6e-filled by a disjoint subcollection ofU, or else \ {jV{L)\ > 

(i + !)IUW(o)|. 
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Proof. Select a maximal disjoint subset W of the e-crust of V. By the 
Vitali lemma and Lemma 6.3, we have 

E w = IUH£^|Uvw|>^|U"C)|- 



We distinguish between two cases. First, if every W £ W satisfies |W \ 
{jU{L — 1)| > e|W|, then, since W is disjoint, we would have 

|U^| > |U^(°)| + |U W \U^(^- !)| 

= |U W (°)|+ E \W\\JU(L-1) 

\\ju(o)\ + x: ei^i 



> 



K 1+ ra))lU"H- 



which gives the desired bound. 

Otherwise, let W £ W be such that \W \\JU(L- 1)| <e|W|. Let 

y = {£/*(&) : < k < L - 1 and ^(L -l)nW^0}, 

Z = {Ui(k) : < k < L - 1 and - 1) C W}. 

By assumption, we know that 

(6.1) \wn({Jy)\>(i-e)\w\. 

We claim that then 

(6.2) |pFn (U- 2 )! ^ (1 — 3e)|W|. 

Suppose that this were not the case. There is then a subset A C W of size 
> e|W|, not covered by 2 and all of whose points are at a distance of at 
least e\W\ from Z \ W. By inequality (6.1), there is some Ui(L — 1) G y \ Z 
that intersects A at a point r. Since U{{L — 1) ^ Z, it must intersect Z\W 
at a point r' . Since Ui(L — 1) contains the interval with endpoints r, r' , we 
have 

|^(£-l)|>|r-r'|>£|PF|, 



so 



\V i {L-l)\>\\U i {L-l)\>2\W\. 

On the other hand, Ui{L — 1) (~)W ^ [because Ui(L — 1) £ y] and since 
Ui(L - 1) C Vi(L - 1), we also have V*(L - 1) n W / 0. Since TV is in the 
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e-crust of V, Lemma 6.3 implies that Vi(L — 1) C W £ , contradicting the size 
bound we got for Vi(L — 1). 

To complete the proof, we apply Lemma 6.4 to the tower Z. We obtain a 
disjoint subcollection of Z (and hence of IX) whose members are contained 
in W and have total size at least (1 — 3e)| U Z\, which, by inequality (6.2), 
is at least (1 — 6e)|W / | , as required. □ 

We can now prove Lemma 5.1, which we rephrase as follows (notice that 
our tower is now numbered starting at and that we have removed the tildes 
from the notation). 

Lemma 6.6. With the above notation, suppose that no interval V G V 
can be de-filled by disjoint elements ofU. Then, 

i, | / e \-LViog(i/s)J|. | 

|uw(o)|<(i+gj |ix4 

PROOF. Set M = log(l/e). It suffices to prove 

|U^| ^ ( 1 + e) M W H 

and it is enough to prove this when L is an integer multiple of M ; write 
L = kM. We proceed by induction on k. The base of the induction is the 
previous lemma. Now, given that it is true for k and given L = {k + 1)M, 
we can apply the induction hypothesis to the restrictions of U, V to levels 
0, 1, . . . , kM. This tells us that 

\\JU(kM)\>(l + f) k \\JU(0)\. 

Now, consider the restriction of the towers to levels kM, kM + 1, ...,(& + 
1)M. Applying the base case, we get 

\\JU((k + 1)M)| > (l + |) \\JU(kM)\. 
Putting these together completes the proof. □ 

As mentioned in the Introduction, everything above can be carried out for 
symmetric intervals and for cubes in Z d ; the proofs generalize easily to that 
case, although the constants change. We note that the Vitali lemma (Lemma 
6.1) requires a different proof in higher dimensions, but this is classical. 

For completeness, we provide the proof of the higher-dimensional analog 
of Lemma 6.2. Consider the case of squares in Z 2 . The e-blowup of a square 
U x V is U £ x V s , which can be written as a disjoint union, 

[/ E xr = ([/x[/)UBiUB 2 UB 3 U B A> 
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where B\ = U e xV\U xV are two vertical strips of width e, B2 = U x V s \ 
U x V are two horizontal strips of height e, B% is the union of two e x e 
squares outside the upper-left and lower-right corners of U x V and B4 is 
the union of two e x e squares outside the upper-right and lower-left corners 
of U x V. To obtain an analog of Lemma 6.2, we must show that if {C/j x Vj} 



is a collection of squares and U- 
then, for each £ = 1,2, 3, 4, 



x = (Ui xy t )uu 



t=l, 2,3,4 ^i,* 



Bj t as above, 



|JS M <2e\JU i xV i 



This follows from the one-dimensional case by decomposing \Ji(Ui x ViU 
Bi t) into the union of the intersection of this set with parallel translates 
of lines. For instance, for t = 1, the intersection of Ui(^i x ViU B^t) with 
each horizontal line is the e-blowup (in the one-dimensional sense) of the 
intersection of (Jj Ui x Vi with that line and so the one-dimensional lemma 
can be applied. Now, sum over all lines. 

The proof for cubes in Z d is proved by induction on the dimension using 
a similar strategy. 
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